Motion Estimation Analysis for Unsupervised Training for Lip Reading User Authentication Systems
نویسندگان
چکیده
This paper proposes a lip reading technique for speech recognition by using motion estimation analysis. Motion estimation is done for lip movement image sequences representing speech. In this methodology, the motion estimation is computed without extracting the speaker’s lip contours and location. This leads to obtaining robust visual features for lip movements representing utterances. Our methodology comprises of two phases, a training phase and a recognition phase. In both phases an n x n video frame of the image sequence for an utterances (can be an alphanumeric character, word or a sentence in more complicated analysis) is divided into m x m blocks. Our method calculates and fits eight curves for each frame. Each curve represents motion estimation of this frame in a specific direction. These eight curves are representing set of features of a specific frame and are extracted in an unsupervised manner. The feature set consists of the integral values of the motion estimation. These features are expected to be extremely effective in the training phase. The feature sets are used to characterize specific utterances with no additional acoustic feature set. A corpus of utterances and their motion estimation features are built in the training phase. The recognition phase is accomplished by extracting the feature set,from the new image sequence of lip movement of an utterance, and compare it to the corpus using the mean square error metric for recognition.
منابع مشابه
Block-Based Motion Estimation Analysis for Lip Reading User Authentication Systems
This paper proposes a lip reading technique for speech recognition by using motion estimation analysis. The method described in this paper represents a sub-system of the Silent Pass project. Silent Pass is a lip reading password entry system for security applications. It presents a user authentication system based on password lip reading. Motion estimation is done for lip movement image sequenc...
متن کامللبخوانی: روش جدید احراز هویت در برنامههای کاربردی گوشیهای تلفن همراه اندروید
Today, mobile phones are one of the first instruments every individual person interacts with. There are lots of mobile applications used by people to achieve their goals. One of the most-used applications is mobile banks. Security in m-bank applications is very important, therefore modern methods of authentication is required. Most of m-bank applications use text passwords which can be stolen b...
متن کاملMouth Localization for Appearance-based Lip Motion Analysis
Analysis of lip motions can be deployed in a variety of applications, e. g. visual speech reading or liveness verification as part of a person authentication system. When utilizing appearance-based features to describe lip shapes (visemes), robustly detecting the position of the mouth center is an inevitable part of this task. In this paper we present an algorithm for mouth localization as part...
متن کاملUnsupervised Extraction of Multi-Frame Features for Lip-Reading
The features of human lip motion from video clips are extracted by three unsupervised learning algorithms, i.e., Principal Component Analysis (PCA), Independent Component Analysis (ICA), and Non-negative Matrix Factorization (NMF). Since the human perception of facial motion goes through two different pathways, i.e., the lateral fusifom gyrus for the invariant aspects and the superior temporal ...
متن کاملUnsupervised Feature Extraction for the Representation and Recognition of Lip Motion Video
The lip-reading recognition is reported with lip-motion features extracted from multiple video frames by three unsupervised learning algorithms, i.e., Principle Component Analysis (PCA), Independent Component Analysis (ICA), and Non-negative Matrix Factorization (NMF). Since the human perception of facial motion goes through two different pathways, i.e., the lateral fusifom gyrus for the invari...
متن کامل